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1.  INTRODUCTION 


1.1.  History 

This  is  a  report  of  the  third  JASON  summer  study  of  advanced  computing  techniques  that 
could  lead  to  a  radical  improvement  in  computer  performance.  The  purpose  of  these  studies  is  to 
identify  trends,  indicators,  and  potential  techniques  that  might  someday  lead  to  such  perfor¬ 
mance.  In  the  first  study  we  investigated  Residue  Arithmetic  and  Symbolic  Computingflj.  In  the 
second  year  we  extended  our  study  of  symbolic  computing  into  Source  Program  Transformation 
and  also  investigated  a  new  topic,  Reversible  ComputingfSj. 

1.2.  Scope 

In  this  report,  we  explore  two  topics  suggested  from  the  last  study.  The  first  is  Molecular 
Computing  which  may  offer  a  technology  that  could  realize  some  of  the  ideas  of  reversible  comput¬ 
ing.  The  second  topic  is  the  Theory  of  Transformation  of  Weak  Methods  into  Strong  Methods. 
This  is  a  direct  extension  of  our  prior  Source  Program  Transformation  work. 

Since  the  goal  of  this  and  all  our  previous  work  is  to  identify  potential  new  developments, 
the  ideas  discussed  herein  are  quite  speculative  and  should  be  considered  with  some  caution. 

We  limit  our  investigation  to  improvements  in  computer  performance  in  the  execution  of 
only  difficult  calculations.  We  will  not  seek  higher  performance  for  problems  easily  managed  by 
today’s  machines. 

1.3.  Difficult  Problems 

In  our  view,  difficult  problem  domains  are  characterized  by  these  factors: 

(1)  Massive  numerical  calculations  on  large  data  structures. 

(2)  Search  for  an  optimal  (or  near  optimal)  solution  over  a  problem  space. 

(3)  Symbolic  calculations,  tightly  coupled  to  the  numeric  calculations  and  the  search  process. 
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Problems  of  this  type  demand  a  computer  architecture  with  massive  main  memory,  fast 
digital  circuits  and  special  structure  for  supporting  symbolic  and  search  calculations. 

1.4.  Limitations  to  High  Performance 

The  rate  at  which  computers  can  execute  is  limited  by  the  serial  nature  of  today’s  computer 
programs,  the  strength  (or  efficiency)  of  the  algorithms  employed,  and  the  physical  size  of  the 
components. 

The  joint  problem  of  overcoming  the  serial  nature  of  todays  computing  tasks  and  the 
expressing,  scheduling  and  executing  parallel  calculations  is  one  of  the  great  unsolved  problems  of 
computer  science.  While  it  is  very  important,  we  have  chosen  not  to  address  ii  in  this  particular 
report.  We  note  however,  that  the  language  Prolog,  employed  ir  this  report,  has  great  potential* 
for  parallel  execution[3|. 

We  will  discuss  the  theory  of  strength  improvement  in  some  detail.  This  is  but  one  aspect 
of  the  efficiency  problem,  however  it  is  the  most  important  for  the  problem  domain  we  have  in 
mind. 

The  primary  limitation  to  circuit  speed  is  the  finite  speed  of  light  (  Maxwell’s  equations  ) 
and  the  need  to  dissipate  heat.  Smaller  components  generate  less  heat  and  therefore  can  be 
packed  closer  together,  decreasing  the  time  it  takes  to  propagate  signals  between  themselves. 
Smaller  computers  can,  all  other  factors  being  equal,  run  proportionally  faster  than  larger  ones. 
Today,  the  basic  component  of  computers,  the  logical  switch,  is  the  transistor.  If  this  switch 
could  be  reduced  in  size  from  about  10"8  meters  to  a  molecular  scale  of  about  KT8  meters,  a 
speed-up  of  perhaps  as  much  as  1000  times  might  be  achievable.  The  possibility  of  this  potential 
speed-up  motivated  this  study  of  molecular  computing. 


la  modern  electronic  devices,  response  time  scales  linearly  with  site  (See  p.  33  of  [4]  ).  However  the  main  argument 
for  linear  speedup  is  the  simple  argument  that  to  first-order,  transit  time,  across  a  device,  is  proportional  to  device  site, 
whatever  the  device 


rV. '  ■.  i  .•  VvVvVV- 

k/lv:iyk.vl  ii  .• 


-\ 


Radical  Computing  III 


3 


2.  MOLECULAR  COMPUTING 

2.1.  Introduction 

Feynman[5]  may  have  been  the  first  to  seriously  consider  the  prospects  of  building  comput- 
ers  on  a  molecular  scale.  He  observed: 

....if  computers  had  millions  of  times  as  many  computing  elements  as  they  do  now,  they  could 
make  such  judgements  (  as  in  image  recognition  ).  But  with  the  present  size  of  components,  these 
computers  would  fill  millions  of  rooms,  cost  impossible  sums  of  money,  and  be  too  big  to  work  an¬ 
yhow.  (In  the  ’smaller’  computers)  ...  the  wires  should  be  ten  or  100  atoms  in  diameter,  and  the 
circuits  should  be  a  few  thousand  Angstroms  across. 

Thus,  the  basic  idea  of  molecular  computing  is  to  radically  reduce  (i.e.  to  molecular  scales)  the 
size  of  the  components  that  make  up  today’s  computers.  This  means  that  the  finest  features, 
today  about  1(T®  meters  would  be  reduced  to  about  10“9  meters.  Thus  switches,  which  today  are 
about  10~*  meters  across,  might  be  reduced  to  the  range  of  10-7  to  10-8  meters.  If  such  molecular 
components  could  be  developed,  then  perhaps  up  to  1018  bits/cms  of  storage  and  down  to  1012 
seconds/gate  of  delay  time  might  be  achieved.  Today,  VLSI  circuits  achieve  about  10®  bits/cm3 
and  about  I0  fl  seconds/gate.  It  is,  of  course,  the  potential  for  this  dramatic  improvement  in 
storage  density  and  circuit  performance  that  motivates  this  study.  The  reader  should  be  cau¬ 
tioned  however,  that  the  ideas  on  molecular  computing  are  speculative.  There  is  very  little  exper¬ 
imental  or  theoretical  work  to  support  most  of  the  interesting  ideas. 

The  modern  ideas  of  Molecular  Electronic  Devices  (MED)  are  the  primary  source  of  ideas  for 
molecular  computers.  These  have  been  nurtured  by  Forrest  L.  Carter  of  the  Navel  Research 
Laboratory  since  1979.  He  has  written  extensively  on  the  ideas[6,7, 8,9, 10, 11,7, 12]  He  has  also 
sponsored  several  workshops  on  Molecular  Electronic  Dev  ices.  [13, 7).  There  is  a  wealth  of 
material  available  from  these  workshops  and  in  the  papers  referenced  by  the  workshop  papers. 
The  reader  is  encouraged  to  examine  these  materials,  although  we  will  mention  topics  covered  in 
these  materials  only  so  far  as  they  apply  to  the  issues  we  discuss  here. 


Radical  Computing  III 


4 


2.2.  Approaches 

There  are  a  number  of  approaches  to  molecular  computing: 

(1)  Soliton  propagation 

(2)  Conduction  electrons 

(3)  Super  conduction 

(4)  "Neuron-like”,  biological  based  elements 

(5)  Chemical  reactions,  similar  to  DNA  etc.  reactions. 

We  will  discuss  the  first  three  possibilities.  We  will  not  consider  the  chemical  or  direct  bio¬ 
logical  approaches,  such  as  nerve  conduction,  because  they  are  too  slow  1  It  is,  however,  possible 
to  employ  biological  assembly  techniques  for  "fast”  non-biological  circuits. 

2.3.  Construction  Techniques 

Feynman[5, 14]  also  discussed  possible  construction  techniques  in  his  pioneering  papers.  In 
particular  he  said[5j 

One  possibility  we  might  consider,  since  we  have  talked  about  writing  by  putting  atoms  down  in  a 
certain  arrangement,  would  be  to  evaporate  metals  or  other  materials  in  successive  layers  until  we 
have  a  block  of  exceedingly  fine  dimensions. 

...  The  electrical  equipment  won’t  simply  be  scaled  down;  it  has  to  be  redesigned.  But  I  can  see 
no  reason  why  it  can’t  be  redesigned  to  work  again. 

We  concur.  There  seem  to  be  several  possible  avenues  for  molecular  construction: 

(1)  Lithography 

(2)  Modular  Chemistry 

(3)  Monolayer  Films 


1  The  response  time  of  nerve  cells  ('1CT2  sec)  is  very  slow  compared  to  that  of  modern  electronic  circuits  (‘10"8 
seconds).  Of  course,  if  the  fast  circuits  are  not  realisable,  then  the  chemical  and  nerve  conduction  approaches  should  be 
investigated  From  where  technology  now  stands,  it  appears  that  the  fast  circuit  techniques  are  no  more  difficult  than  any 
of  the  alternatives,  and  therefore  become  the  choice  for  current  investigation. 
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(4)  ’Self-Assembly’  (Biological  Techniques) 

The  reader  is  referred  to  the  workshop  papers  mentioned  above  for  details  of  all  these  tech¬ 
niques.  There  are  undoubtedly  other  possible  approaches  that  might  be  applied  if  these  techniques 
will  not  suffice.  The  important  point  is  that  there  seem  to  be  a  number  of  possible  techniques 
that  could  possibly  be  developed  to  assemble  molecular  computers. 

2.4.  Machine  Organisation 

2.4.1.  Fault  Tolerance 

The  machine  organization  of  molecular  computers  is  likely  to  be  very  different  from  today’s 
computers  due  to  the  very  high  failure  rates  that  must  be  expected  for  molecular  components. 
Cosmic  rays  for  example,  unstoppable  with  any  practical  shielding,  will  regularly  destroy  portions 
of  the  molecular  circuits.  Manufacturing  techniques  are  not  now,  nor  are  they  ever,  likely  to  be 
perfect,  and  repairs  will  perhaps  be  impossible.  Thus  a  great  redundancy  of  molecular  circuits 
must  be  provided.  Error  (i.e.  failure)  detection  and  correction  circuits  will  be  needed  at  all  levels 
of  molecular  circuits  and  components.  This  is  a  reasonably  well  understood  technical  problem 
and  does  not  pose  any  unsurmountable  difficulties.  Since  this  is  not  a  critical  issue,  we  will  not 
discuss  error  and  failure  problems  any  further  in  this  report. 

2.4.2.  Finite  State  Automata 

In  other  respects,  the  organizational  principles  of  today’s  computers  should  be  applicable  to 
molecular  computers.  An  exception  might  occur  if,  for  some  currently  unseen  reason,  some 
molecules  are  found  that  are  very  efficient  finite  state  machines  that  can  be  organized  into  an 
array  of  cellular  automata.  Array  cellular  automata,  at  least  as  they  are  understood  today,  are 
inefficient  organizations  for  doing  calculations  and  would  not  be  the  first  choice,  all  other  factors 
being  equal. 
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2.4.3.  Circulating  Loops 

The  possibility  of  obtaining  mass  storage  using  molecular  components  is  intriguing.  Early 
digital  computers  used  circulating  loop  delay  lines  as  both  main  and  secondary  (mass)  stores. 
These  were  acoustic  delay  lines  of  mercury,  etc.  Modern  versions  of  "circulating  loop  stores”  are 
the  magnetic  bubble  devices  and  the  charge-coupled  devices  of  modern  electronics.  There  seems 
to  be  a  possibility  that  "circulating  loop  stores"  on  a  molecular  scale  might  be  realizable.  In  par¬ 
ticular,  as  discussed  below,  solitons  can  propagate  down  a  single  molecular  chain  at  about  the 
speed  of  sound.  Thus  such  a  molecular  chain  might  serve  as  a  circulating  loop  store  for  a  molecu¬ 
lar  mass  memory.  Since  the  propagation  is  a  mechanical  distortion,  it  is  subject  to  all  mechanical 
(thermal  and  other)  disturbances  and  so  would  need  to  take  place  in  a  cooled  environment. 

2.5.  Molecular  Components 

There  are  a  number  of  computer  components  that  must  be  implemented  in  molecular  form 
if  a  molecular  computer  is  to  be  realized. 

(1)  Wire 

(2)  Logical  Switches 

(3)  Flip-flops 

(4)  Random  access  memory 

(5)  Mass  memory  (serial  access  acceptable) 

(6)  Input  interface,  electronic  to  molecular  level 

(7)  Output  interface,  molecular  to  electronic  level 

There  are  other  components  (such  as  optical  image  i/o)  that,  although  desirable,  are  not 
necessary.  We  will  not  consider  these  components. 
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2.0.  Wire 

The  function  of  wire  is  to  interconnect  other  components  by  transmitting  signals  over  a  dis¬ 
tance,  roughly  the  diameter  of  the  molecular  computer,  (excluding  the  mass  memory).  Thus,  it 
will  be  necessary  that  this  ’wire’  reliably  transmit  signals  over  a  distance  of  about  10”6  meters.  If 
electromagnetic  phenomena  is  employed  for  signal  transmission,  this  implies  a  time  delay  of  about 
10”*  x  10”*  or  10“18  seconds,  which  is  plenty  fast  compared  to  the  speed  of  the  proposed  logic  ele¬ 
ments  (10~12  seconds).  Propagation  phenomena,  to  be  used  for  transmission  of  signals,  should  thus 
have  a  propagation  velocity  comparable  to  the  speed  of  light,  (3  x  10s  m/s). 

Four  mechanisms  for  signal  transmission  have  been  suggested  in  the  literature.  We  have 
not  been  able  to  think  of  any  additional  ones.  The  mechanisms  are: 

(1)  Light  propagation  in  channels,  fibers,  and  free  space. 

(2)  Soliton  propagation  in  molecular  chains. 

(3)  Classic  (electrical)  conduction  in  molecular  chains. 

(4)  Super  conduction  (electrical)  in  molecular  chains. 

2.0.1.  Light  Propagation 

The  use  of  light  as  a  signal  transmission  mechanism  is  an  old  idea.  There  seem  to  be 
several  molecular  possibilities  for  generating  and  receiving  the  light  signals.  For  example,  Han- 
son(l5]  discusses  the  prospects  and  problems  of  utilizing  Frenkel  excitons  in  several  ways,  one  of 
which  is  to  generate  controlled  phosphorescence  and  launch  a  light  signal  into  a  thin  film  optical 
wave  guide. 

The  main  difficulty  with  all  light  propagation  systems  is  the  inefficiency  inherent  in  convert¬ 
ing  into  and  out  of  the  light  system.  Hansen  recognizes  this  problem  but  states 

This  problem  can  be  circumvented  by  using  light  of  the  optimal  wavelength  and  by  having  the 
desired  process  occur  on  a  faster  time  scale  than  the  dissipative  process. 

Hansen  recognizes  that  his  general  approach,  which  includes  light  propagation,  is,  in  his  words, 
"extremely  speculative".  We  agree.  In  fact  it  seems  to  us  that  light  propagation  is  not  the  most 
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attractive  option  to  pursue  for  realizing  ’wire’.  We  base  this  conclusion  on  the  lack  of  any  solu¬ 
tion  to  the  efficiency  problem  mentioned  above. 

2.0.2.  Solltona 

There  are  several  potential  uses  of  solitons  as  carriers  of  signals  in  the  wire,  delay  lines, 
switches  and  i/o  conversion  devices  of  molecular  computers.  Here  we  will  only  summarize  one 
aspect  of  solitons,  their  use  as  signals  over  ’long'  ("IQ-8  m)  distances.  la  this  context,  it  is  gen¬ 
erally  assumed  that  the  word  ’soliton’  refers  to  the  non-linear  topological  excitation  of  a  long 
molecular  chain  such  as  polyacetylene,  the  solitons  being  the  bond-alternation  domain  walls.  Soli¬ 
ton  propagation,  in  effect  then,  a  propagation  of  a  configuration  (mechanical)  change,  is  limited 
by  the  speed  of  sound  in  the  material,  about  l(T*c.  Over  the  distances  we  have  in  mind  for  ’wire’ 
(HP8  meters)  this  corresponds  to  about  10-8  seconds.  This  is  far  too  slow  for  our  purposes.  As  a 
result,  the  use  of  solitons  to  carry  long-range  signals  is  not  attractive,  and  so  will  not  receive  any 
further  attention  in  this  report. 

Solitons  may,  however,  be  important  for  conducting  signals  over  short  distances  in  other 
specialized  circuits.  Thus,  solitons  cannot  be  entirely  dismissed  as  signal  carriers. 

2.8.3.  Classic  Conduction 

In  today '8  computers,  wires  range  in  length  and  materials,  as  indicated  in  Table  1.  At  best 
resistivity  is  about  10~*  Ohm-cm  for  wire  materials.  The  resistance  of  molecular  wires  can  only  be 
crudely  estimated,  but  for  a  wire  of  copper,  of  length  10~*  cm,  diameter  of  10"®  cm  the  resistance 
is  about  1000  Ohms  or  so,  barely  within  the  range  of  acceptable  resistances.  Therefore  it  will  be 
difficult  to  employ  the  usual  conductors  of  higher  resistivity  as  shown  in  Table  1. 
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TABLE  1:  Properties  of  ’Wires’  Used  in  Computers 


Material 


Utilization 


1.73  *  10-* 

10‘2  -  10"° 

lo-4  -  ltr3 

io-*  -  10° 

Back  Planes,  PC  Boards 

Gold 

2.44  *  IO"8 

IQ-*  -  10"* 

lo*4  -  lcr4 

io-*  -  io-‘ 

Bonding  Wires,  I.C.’s 

Aluminum 

2.62  *  10^ 

% 

I 

y 

o 

io-*  -  io-* 

lo-*  -  io1 

Lead  Frames,  I.C.’s 

U 1 

1.00  *  10-* 

10‘7  -  10"4 

10-*  -  io-* 

10“*  -  10s 

I.C.’s 

Silicon 

8.5  *  10*4 

To 

1 

*r 

o 

IQ-*  -  IO-4 

10-2  -  10* 

I.C.’s 

2.0.4.  Super  Conductors 

Because  of  the  difficulties  of  other  possibilities  for  wires,  superconducting  wires  look  attrac¬ 
tive.  There  are  several  possible  molecular  superconductors.  We  will  mention  only  a  well  known 
one,  (57V),  ,  polysulfur  nitride. 

Polysulfur  nitride  was  the  first  polymeric  metal,  having  been  discovered  in  1910(16] .  It  was 
discovered  to  be  superconducting  only  in  1975.  In  the  past,  there  has  been  some  controversy 
about  the  transition  temperature.  It  is  now  thought  that  the  transition  temperature  for  individual 
fibers  is  0.28  K  but  when  bundles  of  non-perfect  fibers  are  measured,  the  conductivity  depends 
upon  interactions  between  many  fibers  and  the  transition  temperature  then  varies  from  0.26  to 
0.28  K(16], 

This  material  can  be  formed  into  single  cbains[9]  of  the  form: 
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Actual  signal  paths  would  probably  employ  a  number  of  parallel  chains,  mostly  for  reliability. 
Superconductors  seem  the  best  possibility  for  realizing  wires  in  the  molecular  computer. 

2.7.  Logical  Switches 

The  best  possibility  for  a  molecule-sized,  logical  switch  (molecular  gate)  seems  to  be  a  an 
electron  tunneling  device.  According  to  Carter[9]  this  idea  originated  with  Pschenichnov  who  sug¬ 
gested  that  the  transmission  coefficient  of  an  electron  approaching  a  finite  series  of  potential  bar¬ 
riers  would  be  unity  if  the  electron  energy  exactly  matched  the  pseudostationary  potential  of  each 
barrier.  Of  course  any  perturbation  of  any  of  the  barriers  would  forestall  any  electron  conduction. 
Thus  if  a  stack  of  molecules  served  as  the  series  of  potential  barriers,  then  an  electrical  charge 
arriving  at  the  edge  of  one  element  of  the  stack  would  prevent  electron  conduction  from  a  power 
source  to  ground.  Since  the  voltage  of  such  a  power  supply  can  be  easily  controlled,  the  matching 
of  the  electron  energy  to  the  stack  potentials  should  be  possible.  It  seems  feasible  to  use  (SN)X  as 
’wire'  between  such  devices.  Figure  1  illustrates  this  idea  to  realize  a  NOR  gate. 

2.8.  FUp-Flopn 

The  best  method  to  implement  flip-flops  in  molecular  circuits  seems  to  be  the  same  as  in 
modern  digital  electronics:  Make  them  from  gates. 

2.0.  Random  Access  Memory 

The  creation  of  a  molecular  random  access  memory  (RAM)  can  be  approached  as  it  is  in 
today’s  VLSI  circuits,  that  is  ,  build  them  from  gates.  However  in  the  molecular  world  there  is 
likely  to  be  a  much  better  approach  possible  by  taking  advantage  of  special  molecules  that  can  be 
much  more  efficient  in  storing  a  bit  as  a  charge-controlled,  conformational  change.  This  idea  is 
even  more  speculative  than  the  NOR  gate  idea  discussed  above. 


f.  -■ 
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Radical  Computing  III  IS 

3.  TRANSFORMATION  OF  METHODS 

3.1.  Introduction 

In  computer  programming,  as  in  general  mathematics,  often  a  method  conceived  to  solve  a 
problem  is  relatively  inefficient  compared  to  what  is  possible.  Consider  the  calculation  of  the 
Discrete  Fourier  Transform  (DFT).  Only  recently  was  the  Fast  Fourier  Transform  (FFT) 
discovered,  despite  the  simplicity  of  the  DFT  and  the  large  performance  improvement  offered  by 
the  FFT.  If  we  could  find  a  general  method  of  transforming  weak  methods  like  the  DFT  into 
strong  ones  like  the  FFT,  a  radical  improvement  of  performance  would  result.  Thus,  ultimately 
we  seek  a  general  computer  program  that  can  accept  a  weak  method  and  produce  a  strong  one 
from  it.  Generally,  this  is  a  very  difficult  and  open  problem.  In  this  report  we  will  discuss  some 
approaches  to  this  problem  and  will  consider  a  small,  restricted,  example  of  such  a  program  which 
we  developed  to  aid  our  understanding  of  this  problem. 

3.2.  Approach 

The  general  idea  if  to  start  with  a  general,  weak,  method  for  solving  problems  in  a  domain, 
we  then  use  the  experience  in  problem  solving  (and  analysis  of  available  acquired-knowledge  in 
the  domain)  to  formulate  a  stronger  solution-  finding  procedure.  Whenever  possible,  we  find  spe¬ 
cialized  and  very  strong  procedures  for  subdomains  of  the  domain. 

There  is  an  increasing  amount  of  A. I.  research  in  transforming  weak  methods  into  strong 
ones:  Mostow[l7],  Mitchell)  18,  19),  Amarel[20, 21],  Anzai  and  Simon[22).  There  is  similar  work  in 
the  Soviet  Union:  (see  publication  by  Glushkov’s  group  in  Kiev  in  mid-seventies  on  learning  stra¬ 
tegies  for  solving  planning  problems).  Bundy’s  algebraic  manipulation  problem  (in  the  algebraic 
manipulation  system  called  PRESS)|23j  provides  a  good  vehicle  for  assessing  this  approach  and 
for  identifying  key  research  issues. 

Bundy ’8  problem  can  be  formulated  as  follows: 

(1)  Consider  the  domain  of  R-elementary  expressions  (given 
in  Bundy,  p.  207)[23], 
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and  the  axioms  of  algebra  (commutativity,  associativity, 
distributivity,  existence  of  inversion]. 

(2)  Given  an  equation  in  the  form  of  an  R-elementary  expression, 
in  a  single  unknown,  x. 

(3)  Solve  the  equation,  i.e.,  find  an  expression  which 

(i)  is  equivalent  to  the  given  equation 

(ii)  has  the  form 

x  =  r[«], 

where  the  right  side  is  a  term  from  the  language 
of  R-elementary  expression  that  does  not  contain  x. 

(iii)  T  is  'simple'  (where  'simplicity'  is  interpreted 
as  'structural  simplicity’  ot  the  term). 

Now,  the  problem  can  be  approached  by  a  conventional  weak  method  of  forward  search. 
This  method  starts  with  the  given  equation  and,  after  successively  applying  algebraic  axioms, 
transforms  the  equation  into  a  form  which  is  acceptable  as  a  solution. 

Bundy’s  paper  focuses  on  a  stronger  method  (  which  can  be  viewed  as  an  expert  method  in 
this  task  domain)  where  a  procedure  is  introduced  with  a  considerable  amount  of  structure. 

This  procedure  consists  of  five  major  methods.  These  methods  are: 

Normalize, 

Isolate, 

Collect, 

Attract, 

Tidy. 


Each  of  these  methods  utilizes,  in  a  specialized  manner,  subsets  of  the  algebraic  axiom.  With 
each  method  there  is  an  associated  set  of  applicability  conditions. 
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The  Normalize  Method  puts  the  initial  equation  into  a  normalized  form.  Its  applicability 
condition  is  the  presence  of  a  non-standard  symbol.  For  example,  Normalize  converts  the  expres¬ 
sion  A/B  into  A*B'(-1),  which  "looks”  worse,  but  eliminates  the  need  for  the  division  symbol 
Thus  this  form  is  actually  better  relative  to  the  total  number  of  symbol  types  needed  to  express 
all  equations. 

The  Isolate  method  strips  out  connectives  from  the  term  that  contains  x  in  the  expression 
under  consideration,  and  eventually  brings  the  expression  to  a  form: 

F  :  z  =  T\tx\ 

The  Isolate  method  is  applicable  only  if  the  expression  F  has  a  single  occurrence  of  x. 

The  Collect  method  reduces  the  number  of  occurrences  of  x  in  an  expression  under  con¬ 
sideration;  of  course,  this  method  is  applicable  only  if  the  number  of  occurrences  of  x  in  an 
expression  F  is  larger  than  one.  (N(x,F)  >  1). 

The  Attract  method  transforms  an  expression  F  so  that  it  has  a  form  on  which  the  collect 
method  can  apply.  Eventually,  the  Attract  method  consists  of  ’bringing  closer’  or  ’grouping’ 
terms  in  x.  Here  again,  the  method  applies  only  if  N(x,F)  >  1. 

The  Tidy  method  reduces  the  structural  complexity  of  any  term.  This  method  is  applied 
wherever  possible. 

Now,  the  overall  Solution  method  consists  off  applying  in  sequence  Normalize,  Attract  (if 
needed),  Colled,  and  Isolate.  Throughout  this  process,  Tidy  is  applied  whenever  it  is  applicable. 

The  interesting  issue  is  understanding  the  transition  from  the  weak,  flat,  solution  method  to 
the  structured  expert  method.  What  is  involved  in  this  transition?  What  are  the  bodies  of 
knowledge  needed  a  priori  to  effect  the  transition?  What  representation  and  process  can  be  used 
to  mechanize  the  transition?  What  are  the  key  open  problems  in  this  area? 

Consider  a  restricted  domain,  with  expression  from  a  restricted  language,  L,  where  the  only 
1-ary  function  symbols  are  ’square’,  ’squareroot’,  and  the  2-ary  function  symbols  are  ’+  ’, 
*,  /.  (This  means  that  trigonometric  and  exponential  functions  are  excluded).  The  restricted 
domain  permits  some  simple  experiments  in  the  domain. 


Radical  Computing  III 

Now  consider  applicability  conditions  for  elements  (moves)  of  the  Isolate  method. 
ture  of  the  current  expression,  F,  can  be  represented  in  the  form  of  a  tree,  as  follows: 


i 


IS 


1 


* 


! 


The  struc- 


F: 

fU  *  12 

Now  let  F j  be  the  largest  subterm  of  F  containing  x.  We  represent  Fx  by  the  subtree  of  F  which 
is  rooted  at  +  .  Also,  Fxx  is  the  ’child'  of  Fx  that  contains  x,  FX2  is  the  ’sibling’  of  Fn,  F2  is  the 
’sibling’  of  Fx.  N(x,F)  =  1  :  This  is  the  main  applicability  condition  for  the  Isolate  method;  it 
states  that  the  number  of  occurrences  of  x  in  F  is  1. 

Now,  if  the  conditions  apply,  then  by  applying  the  ’inverse  of  +  ’,  i.e.,  by  adding  -F 12  to 
both  sides  of  the  equation  we  obtain  a  reduction  by  1  of  the  depth  of  x  in  F  (which  is  the  goal  of 
application  of  steps  in  the  isolate  method). 

Thus,  the  effect  of  the  move  can  be  seen  in  the  form  of  the  transition: 


Now,  similar  rules  exist  for  cases  where  root-of-Fj  is  or  ’*’  or  Slightly  different  forms 
are  needed  where  root-  of-  Fx  is  ’squareroot’  or  ’square’. 

Next,  consider  what  enters  in  applicability  conditions  for  elements  (moves)  of  the  Collect 


method.  The  current  expression  is  F,  and  Fx  is  a  subterm  (subtree)  of  F. 
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Fx  is  the  least  dominating  term  in  x  of  the  expression  F,  with  two  of  its  immediate  subterms 
(immediate  subtrees)  containing  x.  In  the  tree  representation,  these  subterms  are  shown  as  Fn, 
F 12;  and  F1S,  Fu  stand  for  terms  of  arbitrary  form.  Now,  if  N(x,  F)  >  1  :  main  condition 

(there  are  at  least  2  occurrences  of  x)  and  F ujti]  =  F^tx |  Then  the  following  distributive  axiom 


applies: 


(u  *  v)  +  (u  *  w)  =  u  *  (v  +  w) 


The  system  matches  Fn  [ex],  Fn  \tx\  to  u  and  the  effect  of  the  ’move’  application  is  as  follows: 


In  this  transition,  the  number  of  occurrences  of  x  decreases  --  which  is  the  goal  of  Collect 


Now,  the  key  problem  is  how  to  automate  the  formulation/discovery  of  applicability  condi¬ 
tions  for  moves  such  as  the  above  Isolate  and  Collect  moves. 

What  is  involved  in  approaching  the  problem  in  the  same  way  as  in  Meta-dendral  and 
LEXfSfj?  Briefly,  we  analyze  traces  of  solutions  (obtained  via  the  initial  weak  method).  For 
each  application  of  a  move  (i.e.,  application  of  an  algebraic  axiom)  we  retain  the  context  of  the 
expression  on  which  it  is  applied.  Then,  we  formulate  a  generalization  as  a  pattern  which  is  con¬ 
sistent  with  all  the  contexts  of  expression  associated  with  application  of  the  move.  To  obtain 
such  generalization  we  keep  a  version  space  of  generalization. 
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Upon  examination  of  the  Bundy  problem,  we  see  that  a  LEX-like  approach  to  learning 
strong  applicability  conditions  for  moves  is  possible  provided  that: 

(a)  We  have  an  ’appropriate'  representation  framework 
for  expressing  ’expression  contexts’  (in  terms  of 
which  applicability  conditions  are  to  be 
formulated),  and 

(b)  We  have  a  notion  of  a  ’generalization’  hierarchy 
to  help  us  formulate  patterns  in  a  set  of 
'expression  contexts'. 

It  appears  that  there  is  no  difficulty  in  defining  a  generalization  hierarchy  in  the  present 
domain.  The  definition  of  R-elementary  expressions  (p.  207  Bundy  paper)  provides  a  good  basis 
for  the  hierarchy;  but  some  changes  may  be  necessary.  More  work  is  needed  here. 

To  obtain  an  appropriate  representation  framework,  it  is  important  to  reason  back  from  the 
overall  goal  of  the  equation  solving  task.  We  can  find  concepts  such  as: 

-Number  of  occurrences  of  x  in  a  term, 

-term  -  subterm  relationship, 

-term  including  x, 

-notions  of  structural  simplicity  of  expressions 

in  the  initial  problem  formulation.  Work  is  needed  to  assess  the  difficulty  of  mechanizing  this 
choice  of  conceptual  framework  on  the  basis  of  transfer  from  the  problem  formulation. 

Note  that  a  LEX  approach  to  procedure  improvement  may  be  sufficient  in  this  case.  How¬ 
ever,  it  may  be  desirable  to  have  an  approach  that  leads  to  the  structural  description  of  methods 
presented  by  Bundy.  To  do  this,  we  need  to  be  able  to  discover  a  global  decomposition  of  stages 
in  solution  and  to  formulate  for  each  stage  a  micromove/ maneuver.  This  is  a  very  promising 
direction  of  research  on  expertise  acquisition  in  problem  solving.  More  research  is  needed  here. 
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3.3.  JASON-PRESS  System 

In  order  to  evaluate  the  approach  taken  by  Bundy  and  Welham,  in  their  construction  of  the 
PRESS  system,  we  attempted  to  reconstruct  part  of  PRESS  from  the  fragments  that  have  been 
published.  The  part  we  attempted  to  reconstruct  was  just  enough  to  do  simple  algebra  problems. 
A  listing  of  our  version  of  the  PRESS  system  is  given  in  the  appendix. 

The  reconstruction  was  largely  successful,  in  that  our  version  could  solve  many  difficult 
algebra  problems  and  all  but  one  of  the  examples  that  PRESS  could  solve.  This  is  the  equation: 

(  2C0*2(I^2*ln^1^  =  tf1!*) 

We  believe  with  the  addition  of  more  specialized  axioms  this  problem  could  probably  be  solved  by 
our  PRESS  system.  More  interesting  however,  we  have  tried  some  problems  that  we  feel  PRESS 
cannot  solve,  because  of  the  fundamental  structure  of  PRESS.  In  particular  PRESS  has 
apparently  only  ad-hoc  facilities  to  factor  equations,  even  when  it  would  be  simple  to  do  so.  For 
example  our  version  of  PRESS  [JASON-PRESSj  cannot  solve  the  simple  formula 

log*x-\ogex  =  0. 


Factorization  is  one  approach,  thus  transforming  the  equation  to: 

y= logfz  ;  y2-y=0. 

Then  again  transforming  to 


and  then  again  to 


z  =  ey  ;  y(y-i)=0. 


y=  0  ;  y  =  l  ;  x  =  ey . 


This  observation  led  to  a  new  idea  to  add  to  the  capability  of  PRESS.  This  is  to  add  is  the  max¬ 
imum  height  of  the  covering-tree  of  X.  The  reduction  method  is  to  reduce  the  maximum  height 
of  this  covering-tree  of  X.  For  the  example  shown  above,  the  tree  is: 
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The  maximum  height  of  this  tree  is  the  length  of  the  far  left  branch  (four  edges).  The 
transformed  covering-tree  (set)  is: 


/  \ 
Y  0 


The  maximum  tree-height  is  now  only  two  links.  Although  we  have  not  taken  the  time  to  imple¬ 
ment  the  FACTOR  solution  method  in  JASON-PRESS,  it  seems  clear  that  it  would  not  only 
allow  a  wider  range  of  equations  to  be  solved,  but  would  provide  a  very  clean  method  of  solving 
the  one  equation  given  by  Bundy  and  Welham  that  our  version  of  PRESS  could  not  solve. 

We  learned  a  great  deal  in  implementing  and  experimenting  with  our  version  of  PRESS. 
First,  we  found  the  power  of  Prolog  to  implement  Meta-rules  outstanding.  Second,  the  power  of 
the  meta-rules  to  improve  performance  was  impressive.  Third,  the  difficulty  of  implementing  real 
programming  systems  is  still  with  us.  Finally  we  gained  much  insight  into  the  possibility  of 
automatically  producing  strong  programs  from  weak  methods. 


3.4.  General  Strength  Improvement  Scheme 

We  now  wish  to  work  out  a  general  strength  improvement  scheme  from  what  we  have 
learned  from  our  experimentation  with  our  PRESS  system.  PRESS  solves  equations  in  ordinary 
algebra.  This  algebra  has  a  formal  axiomatic  definition.  Table  2  illustrates  a  suitable  fundamen¬ 
tal  set  of  axioms  for  algebra,  and  in  the  form  of  Prolog  facts.  It  is  easily  read.  For  example  the 
entry  axiom(5,A+  B,B+  A)  says  that  this  is  the  5th  axiom  and  that  the  term  A+  B  may  be 
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replaced  by  the  term  B+  A.  A  and  B  may  represent  any  simple  or  complex  expression. 

The  weak  solution  method  is  very  simple,  but  very  general.  Lemmas  are  derived  from 
axioms  until  one  is  found  that  will  convert  a  general  equation,  into  the  form: 

x  =  Expression  (not  containing  x). 

Now  we  will  derive  a  Prolog  representation  of  this  concept. 

It  is  true  that  we  can  solve  Expression  with  respect  to  X  for  an  Answer  IF  it  is  true  that  there  is  a  * 
lemma  that  converts  Expression  to  an  answer  of  the  form  z  =  Exp  AND  it  is  true  that  there  is 
no-occurrence  of  Xin  Exp.  In  Prolog  this  statement  is: 

solve(Expression,  X,  Answer)  IF 
lemma(Expression,  X  =  Exp)  AND 
no-occurrence(X,  Exp). 

Tbe  definition  of  the  lemma  concept  contains  the  search  component.  A  lemma  transforms  an 
Expression  to  a  Result  IF  there  is  an  axiom  that  relates  the  Expression  and  the  Result.  In  Prolog 
this  is: 

lemma(Expression,  Result)  IF  axiom(Expression,  Result). 

Also  a  lemma  transforms  an  Expression  to  a  Result  IF  there  is  an  axiom  that  transforms  Expres¬ 
sion  to  the  Answer  AND  a  lemma  that  transforms  the  Answer  to  the  Result.  In  Prolog  this  is: 

lemma(Expression,  Result)  IF 
axiom(Expression,  Answer)  AND 
lemma(Answer,  Result). 

The  concept  of  a  transformation  axiom  is  a  simple  statement  of  fact: 
axiom  (A*B,  B*A). 

There  are  about  two  dozen  such  axioms  needed  for  ordinary  algebra.  Table  2  illustrates  these  and 
the  ’structural’  axioms  to  be  discussed  next. 
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Table  2.  Fundamental  axioms. 

%  Equality  axioms 
axiom(0,A=B,B=A). 

%  Addition  axioms 

axiom(l,A+ B,B+ A).  %  Commutation 

axiom(2,A+  (B+  C),(A+  B)+  C).  %  Distribution 
axiom(3,A+  0,A).  %  Existence  of  Zero 

axiom(4,A+  (-A),0).  %  Existence  of  Negative 

%  Multiplication  axioms 
axiom(5,A*B,B*A).  %  Commutation 

axiom(6,A*(B*C),(A*B)*C).  %  Distribution 

axiom(7,A*l,A).%  Existence  of  One 
axiom(8,A*A*(-l),l)  >  A  ==0.  %  Existence  of  Inverse 
axiom(9,A*(-l),l/A).  %  Definition 

%  Distribution 

axiom(10,A*B+  A*C,A*(B+  C)).  %  Add/Mult  Distribution 
%  Exponentiation 

axiom(lllA*X*B‘X,(A*B)"X).  %  Distribution 

axiom(12,A‘X*A‘Y,A‘(X-f  Y)).  %  Distribution 

axiom(13,A*l,A).  %  Existence 

axiomj  14,0“ 0,1)-%  Definition 

axiom)  15, log(e‘X),X).  %  Existence  of  Inverse 

axiom(16,root(X,A"X),A).  %  Existence  of  Inverse 

axiom(17,root(2,X),sqrt(X)).  %  Definition 

%  Trig 

axiom)  18, sin(0),0).  %  Definition 

axiom)  19, sin(pi),0).  %  Definition 

axiom(20,cos(0),  1).  %  Definition 

axiom(21,cos)pi),-l).  %  Definition 

axiom(22,sin(A)‘2+ cos(A)*2,l).  %  Definition 
axiom(23,sin(A)*cos(A)‘(-l),tan(A)).  %  Definition 
axiom(24,sec(A)A(-l),sin(A)).  %  Definition 
axiom(25,cosec(A)*(-l),cos(A)).  %  Definition 
axiom(26,cotan(A)‘(-l),tan(A)).  %  Definition 


axiom(27,arcsin(sin(A)),A).  %  Existence  of  Inverse 

axiom(28,arccos)cos(A)),A).  %  Existence  of  Inverse 

axiom(29,arccosec(cosec(A)),A).  %  Existence  of  Inverse 

axiom(30,arctan(tan(A)),A).  %  Existence  of  Inverse 

axiom)31,arcsec(sec(A)),A).  %  Existence  of  Inverse 

axiom(32,arccot)cot)A)),A).  %  Existence  of  Inverse 


%  Structure 

axiom(|i,N],  E,A)  >  axiom(N,A,E). 
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axiom(N,  E,A)  :- 
E=..[Op,L  ,RJ, 

((not  atojnic(L),axiom(N,L,NL),  A=..[Op,NL,  R|); 
(not  atomic(R),axiom(N,R,NR),  A=..[Op,L  ,NRj)). 


axiom([f,Op],A=B,NA=NB) 

member(Op,j+  log, root]), 

(NA=..(Op,A,X), 

NB=.  [Op,B,X|); 

member(Op,[-,sqrt,sin,  cos,  tan,  sec,  cosec,  cotan, 
arcsin.arccos.arctan.arcsec, 
arccosec  .arccotan] ), 

(NA=..|Op,A], 

NB=..[Op,Bj). 


The  first  structure  axiom  is  the  statement  that  axioms  can  be  applied  in  either  direction. 
While  some  axioms  are  their  own  inverse,  some  are  not.  Therefore  for  some  axioms,  we  need  to 
provide  an  inverse;  In  Prolog  this  is: 

axiom  (x,y)  IF  axiom  (y,x). 

Thus  either  l)the  above  statement,  can  be  included  in  the  axiom  set,  or  2)  only  the  inverses  of 
those  axioms  to  which  it  applies  can  be  included  in  the  axiom  set.  For  example, 
axiom(A+  (B+  C),  (A+  B)+  C)  needs  an  explicit  inverse  in  the  form: 

axiom((A+  B)+  C,A+  (B+  C)),  while  axiom(A+  B,B+  A)  needs  no  explicit  inverse  (it  is  its  own 
inverse).  Table  2  illustrates  the  use  of  the  first  approach. 

In  the  algebra  systems  we  wish  to  examine,  algebraic  structures  are  built  up  from  atomic 
symbols  such  as  {’3’,  'x',  ’father'},  and  operators  such  as  {=,-,*,/}.  Axioms  and  lemmas  apply  to 
both  whole  expressions  and  the  non-atomic  components  of  the  expressions.  For  example  the 
axiom  A+  B  <  =  >  B+  A  can  be  applied  to  x*(a+  b)  to  yield  x*(b+  a).  Therefore  it  is  an  axiom 
that  an  axiom  can  also  be  applied  to  components  of  an  expression.  Also  then,  an  axiom 
transforms  an  Expression  to  a  Result  IF  the  Expression  is  parsed  into  an  Operator,  a  Left  part 
and  a  Right  part  AND  either  the  Left  part  is  not  atomic  AND  an  axiom  can  transform  the  Left 
part  into  a  New  Left  part  AND  the  Answer  can  be  constructed  OR  the  Right  part  is  not  atomic 
AND  an  axiom  can  transform  the  Right  part  into  a  New_Right  part  AND  the  Answer  can  be  con¬ 
structed.  In  Prolog  this  can  be  added  to  our  other  definitions  of  axioms  as  illustrated  in  Table  2. 
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In  Prolog  it  is  customary  to  compress  the  notation  and  use  in  place  of  IF,  in  place  of  AND, 
and  in  place  of  OR.  We  will  switch  to  this  notation  below. 


3.S.  Results 

A  Prolog  query  to  solve  a  very  simple  equation  z-a=0  is:  8ot'vc(x-a=0,z, Result).  However 
for  our  weak  system,  the  full  axiom  set  is  too  big  for  our  weak  system  to  explore  in  a  reasonable 
time.  Therefore  we  have  removed  all  axioms  except  those  that  apply  to  structure,  equality,  addi- 
tion  and  subtraction.  The  resulting  simplified  program,  but  modified  to  produce  a  proof,  follows. 


p  prf(Proof), 

solve(Proof,  x-a=0,x, Result), 
nl,write('Result:  ’), 
write(Result),nl, 
write(’  Proof:  ’), 
write(Proof),nI. 

Prf(l  %  This  is  a  hint  consisting  of  the  first  two  steps 

Ip, 1,3], 

[p,l,[p,r,4]] 

IT  |). 

so!ve(P,E,X,X=A)  Iemma(P,E,X=A),no_occurrence(X,A).  %  Solve  procedure 


*  v 

5 


*4 

& 

H 

$ 

iCv 


■H 


lemma([N|,E,A)  :-  axiom(N,E,A).  %  Lemma  procedure 

lemma((N|M],E,A)  :-  lemma(M,E,T),axiom(N,T,A). 


■•Vi 


no_occurrence(X,N). 


vv 
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no_occurrence(X,  [[]] ). 

no_occurrence(X,H)  atomic(H),  X  ==  H,!. 
no_occurrence(X,Exp)  Exp  =..  [Op,  Y|T], 
no_occurrence(X,  Y),!, 
no_occurrence(X,  T),!. 

%  Fundamental  axioms  limited  to  + 

%  Equality  axioms 
axiom(0,A=B,B=A). 

%  Addition  axioms 

axiom(l,A+ B,B+ A).  %  Commutation 

axiom(2,A+  (B+  C),(A+  B)+  C).  %  Distribution 

axiom(i2,(A+  B)+  C,A+  (B+  C)).  %  Distribution 

axiom(3,A+  0,A).  %  Existence  of  Zero 

axiom(i3,A,A+  0).  %  Existence  of  Zero 

axiom(4,A+  (*A),0).  %  Existence  of  Negative 

axiom(i4,0,A+  (-A)).  %  Exbtence  of  Negative 

axiom(n,(A  -  B),  (A  +  (-B))  ).  %  Definition  of  Negative 

axiom(in,(A  +  (-B)),  (A  -  B)  ).  %  Definition  of  Negative 

axiom(N,E,A)  axiomp(N,E,A). 

axiomp(|p,l,Nj,  E,A)  !,  E=..(Op,L,R|, 

not  atomic(L),axiom(N,L,NL),  A=..(Op,NL,  R). 

axiomp([p,r,N),  E,A)  !,  E=..|Op,L,R], 

not  atomic(R),axiom(N,R,NR),  A=..[Op,L  ,NRj. 


“ — ™ we,wwwgBIBW^ W.I  i'i  vxm  V  rxrnn  .unrj  ffr 


"Vw  .  ij i , „  , 
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solve(N,b*(a-x)=c,x,A), 

write(’  The  answer  is  :  ’),  write(A), 
write(’  Proof:  *),write(N),nl. 

solve(P ,E,X,X=:A)  lemma(P,E,X=A). 

lemma([N] ,E,A)  ax(N,E,A). 
lemma([N|M],E,A)  lemma(M,E,T),ax(N,T,A). 

ax(N,E,A)  not  atomic(E),  axiom(N,E,A). 

axiom(l,  A=B,B=A). 
axiom(2,  A*B,B*A). 
axiom(3,  A+  B,B+  A), 
axiom (4  ,(A-B),(-(B)+ A)), 
axiom (5  ,A+  B=0,A=(-(B))). 
axiom(6,  A-B=0,A=B). 
axiom(7,  A*B=1,A» B*(-l)). 
axiom(8,  A+  B=C,A=C-B). 
axiom(9,  A-B=C,A—C+  B). 
axiom(10,A*B=-C,A=C/B)  >  not  B  =  0. 
axiom(  11,  A*(B-C),A*B-A*C). 
axiom(  12,  A*(B+  C),A*B+  A*C). 

axiom(N,  E,A) 

E»..[Op,L,R), 

!(ax(N,L,NL),  A=..[Op,NL,  R|); 
(ax(N,R,NR),  A-..|Op,L  ,NR])). 


» 


■,  V  *  *  •  -  - 


,  r 


v  *-V- 


-V. 
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This  improved  system  is  much  more  powerful.  It  immediately  solves,  un-aided,  the  expression  x- 
a=0,  and  is  capable  of  solving  much  more  complicated  expressions  such  as  b*(a-x)=c  in  just  a 
few  seconds.  The  power  of  the  'derived'  axioms  used  here  is  very  apparent.  In  particular  the 
axioms  (A-B=0,  A=B)  was  especially  useful.  It  is,  of  course,  just  an  axiomatization  of  the 
results  of  solving  our  original  example  equation.  This  result,  leads  to  the  idea  of  automatically 
generating  a  stronger  method  from  a  weak  one. 

3.7.  Generating  a  Strong  Method 

We  have  gained  some  insight  from  the  above  described  experiment  and  our  experience  with 
the  PRESS  system. 

(1)  Powerful  methods  require  powerful  rules  (axioms). 

(2)  Powerful  rules  can  be  derived  from  fundamental  ones,  that  may  be  weak. 

(3)  When  humans  solve  equations,  they  only  apply  the  axioms  that  will  be  likely  to  lead  to  a 
solution.  The  PRESS  system  mimics  this  idea  with  its  metarules  that  control  the  applica¬ 
tion  of  axioms. 

Our  idea  of  automatically  generating  a  strong  method  form  a  weak  one  is  a  generalization  of 
the  method  used  in  PRESS.  It  will  also  result  in  a  somewhat  weaker  system  than  PRESS,  as  it 
will  not  have  the  extensive,  by  hand,  fine-tuning  that  has  been  devoted  to  the  PRESS  system. 
The  method  follows  in  the  next  four  subsections. 

3.7.1.  Metrics 

First  we  define  a  set  of  metrics  that  estimate  the  distance  an  expression  is  from  a  solution, 
in  each  of  the  corresponding  dimensions.  These  metrics  are  summarized  in  Table  3. 


TABLE  3:  Axiom  Sets  for  PRESS. 


EXAMPLE 


A/B  — >  A*B‘(-1) 


SET 

NAME 

METRIC 

ggjgjgggl 

#  of  operator 
symbol  types 

TIDY* 

#  of  total 
occurrences  of 
all  operators 

ATTRACT* 

Size  of  the 
covering  tree 
of  X 

COLLECT* 

#  of  occurrences 
of  X 

ISOLATE* 

The  distance  of  X 
from  the  left 
side  of  expression 

FACTOR 

Max.  depth  of  the 
covering  tree  of  X 

log(X)  +  log(X)  ->  log(X*X) 


X  +  X  ->  2*X 


*  These  sets  were  originally  designed  by  hand  by 
Bundy  and  Welham  for  the  PRESS  system. 


All  but  the  last  one  were  inspired  by  the  PRESS  System  of  Bundy  and  Welham[23j  (The  PRESS 
paper).  All  the  metrics  are  explained  above.  For  each  metric  we  must  write  a  Prolog  procedure 
to  determine  if  one  expression  is  an  improvement  over  another  expression,  relative  to  the  metric. 
For  example  consider  the  tidy  metric.  Its  metric  is  the  total  number  of  operators  that  occur  in  an 
expression.  A  Prolog  procedure  to  measure  this  Cost  of  an  Expression  is: 
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The  first  two  lines  assign  zero  cost  to  atoms  and  variables.  The  remainder  count  operators.  The 
improvement  procedure  (for  tidy)  can  now  be  written  as: 


improved(E,R)  >  cost(E,Ce),  cost(R,Cr),  Ce>Cr,!. 


3.7.2.  Selecting  Lemmas 

Second,  We  generate  lemma  for  the  fundamental  axioms  as  described  above,  and  select  only 
those  that  can  improve  an  expression  more  than  those  that  have  already  been  selected.  We  call 
the  survivors  axioms  but  also  signify  to  which  set  they  belong.  Thus  axioms  generated  to  tidy 
expressions  and  called  tidy_axioms.  Since  the  fundamental  axioms  are  bi-directional  in  their 
application,  the  fundamental  lemma  generation  process  will  itself  generate  potential  expressions 
for  potential  improvement.  There  is  a  recursive  ’boot-strapping’  process  going  on  here.  It  must 
start  somewhere.  Therefore  a  ’starter’  axiom  must  be  provided.  For  the  tidy  axioms,  this  will  be 
the  null  tidy_axiom(E,E). 

This  technique  has  some  similarity  to  the  Knuth-Bendix  procedure[25]  and  to  the  ’chunking' 
ideas  of  Newell,  Rosenbloom  and  Laird[26, 27). 

3.7.3.  Structure  Axioms  Third,  we  will  rewrite  the  fundamental  structure  axiom  to  only 
employ  the  new  special  set  of  axioms.  For  the  tidy  axioms,  we  will  call  this  procedure  ’tidy’.  How¬ 
ever  it  must  do  more  than  just  find  some  improvement  in  an  expression.  We  want  it  to  find  all 
possible  improvement.  Thus  it  will  not  only  apply  axioms  to  the  components  of  an  expression,  but 
to  the  composite  constructions  as  well.  This  procedure  is  very  similar  to  the  tidy  procedure  of 
PRESS  as  it  appears  in  the  appendix.  In  Prolog  it  is: 
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tidy(U,U) var(U),!. 
tidy(U,Z)  tidyax(U,V), 

(  (  spIit_exp(V,Op,L ,R), 
tidy(L,NL),  tidy(R.NR), 

W  =..[Op,NL,NR]  ); 

(  V  =..  [Op,R], 
tidy(R,NR), 

W  =..  |Op,NR|  )  ), 

tidyax(W,Z), 

improved(U,Z),!. 

tidy(U,V)  tidyax(U,V),improved(U,V),!. 
tidy(U,U). 


3.7.4.  Arithmetic  Fourth,  We  will  provide  aa  axiom  to  do  arithmetic.  This  will  allow  expres¬ 
sions  such  as  1+  1+  1  to  be  reduced  to  a  single  number  (  3  in  this  case).  Obtaining  a  powerful 
such  a  addition  reduction  of  number  is  not  at  all  simple,  if  it  must  he  derived  from  fundamental 
arithmetic  axioms.  However  it  is  of  course  very  important  for  computing  and  is  hard-wired  into 
every  modern  digital  computer.  So  we  will  adopt  numeric  reductions  without  resorting  to  funda- 
mentaF axioms.  In  Prolog  the  axiom  is: 

tidyax(Exp,Rslt)  :-  Exp  =..|Op,Al,A2], 
number(Al), 
number(A2), 
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tidy_ax(arcsec(X)=arcsec(Y),X=Y). 

tidy_ax(arctan(X)=arctan(Y),X=Y). 

tidy_ax(arccos(X)=arccos(Y),X=Y). 

tidy_ax(arcsin(X)=arcsin(Y),X=Y). 

tidy_ax(cotan(X)=cotan(Y),X=Y). 

tidy_ax(cosec(X)=cosec(Y),X=Y). 

tidy_ax(sec(X)=sec(Y),X=Y). 

tidy_ax(tan(X)=tan(Y),X=Y). 

tidy_ax(cos(X)=cos(Y),X=Y). 

tidy_ax(sin(X)— siD(Y),X=Y). 

tidy_ax(sqrt(X)== sqrt(Y),X=Y). 

tidy_ax(-X=-Y,X=Y). 

tidy_ax(root(X,Y)=root(Z,Y),X=Z). 

tidy_ax(Iog(X,Y)=log(Z,Y),X=Z). 

tidy  _ax(X  *  Y=  Z "  Y,X=  Z). 

tidy_ax(X/Y=Z/Y,X=Z). 

tidy_ax(X*Y=Z*Y,X=Z). 

tidy_ax(X-Y=Z-Y,X=Z). 

tidy_ax(X+  Y=Z+  Y,X=Z). 

tidy_ax(arccot(cot(X)),X). 

tidy_ax(arcsec(sec(X)),X). 

tidy_ax(arctan(tan(X)),X). 

tidy_ax(arccosec(cosec(X)),X). 

tidy_ax(arccos(cos(X)),X). 

tidy_ax(arcsin(sin(X)),X). 

tidy_ax(cotan(X)*-l,tan(X)). 

tidy_ax(cosec(X)  -1  ,cos(X)). 


Radical  Computing  III 


34 


Our  general  methodology  can  be  summerized  as  follows: 

(1)  Define  metrics  that  estimate  the  distance  to  a  solution  in  each  dimension  of  the  problem 
space. 

(2)  Generate  specialized  sets  of  axioms,  selected  by  each  metric. 

(3)  Apply  each  set  on  turn  to  the  expression  to  be  solved  so  the  application  of  each  set  pro¬ 
vides  suitable  expressions  for  the  next  set  to  be  applied.  See  the  PRESS  system  for  details 
of  this. 

It  does  seem  feasible  to  automatically  generate  strong  methods  from  weak  ones  in  some 
domains,  at  least  in  ordinary  algebra.  At  the  moment  we  do  not  have  such  a  system,  but  it  seems 
reasonable  to  expect  such  a  system  could  be  made  to  work.  Clearly  much  remains  undone. 

The  potential  applications  of  these  ideas  include  not  only  algebraic  manipulation,  but  more 
complex  systems  as  well.  For  example,  if  one  starts  from  an  axiom  set  for  a  Galois  field  then  it 
may  be  possible  to  derive  some  interesting  methods  for  dealing  with  cipher  problems,  etc.  Simi¬ 
larity,  if  complex  arithmetic  is  added,  then  some  interesting  methods  for  solving  signal  processing 
problems  may  be  possible. 

One  view  of  the  method  is  that  it  is  a  ’super-compiler’.  It  is  clear  that  in  the  future,  the 
general  approach  presented  here  could  have  significant  impact  on  software  development  tech¬ 
niques. 
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APPENDIX  A-l 
PRESS 

%  Main  PRESS  system  Despain  August  84 

consult(normalize), 

consult(solve), 

reconsult(display). 

%  Test  of  press... 

tp  reconsult(pre&s_tests), 

nl,  t(N,  Exp),  write(’CASE:  ’), 
write(N),nl,  press(Exp,  x,  Ans), 
display(Exp,  Ans),  fail. 

%  Main  PRESS  procedure. 

press(Exp,  X,  Ans)  tidy (  Exp,E),  normalize(E,  Rslt),  solve(Rslt,  X,  Ans), 
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%  Normalize  Procedure.  Despain,  November  1984. 


normalize(U,V)  removeall(U,  W),!,restrictall(S,  W,  V),!. 

removeall(U,Z)  remove(U,V),  V  =..  [Op|Args], 
remove_list(Args,Nargs), 

W  =..  [Op|Nargs], 
remove(W,Z). 

removeall(U,V)  >  remove(U,V). 
remove_list([],[|). 

removeJist([H|T],[Nh|Nt])  >  removeall(H,Nh),removeJist(T,Nt). 

%  Remove  axioms 
remove(A<B,  B>A). 
remove(A/B,  A*B"(-1)). 
remove(root(R,  X),  X"R‘(-1)). 
remove(A,  A). 


%  Restrict  procedure 

restrictall(S,  U,  Z)  >  restrict(S,  U,  V), 

V  =..  [Op|Args], 
restrict_list(S,  Args,  Nargs), 

W  =..  |Op|Nargs], 
restrict(S,  W,  Z). 

restrictall(S,  U,  V)  >  restricts,  U,  V). 

restrict_list(S,  ],  []). 

restrict_list(S,  H|T],[Nh|Nt])  restrictall(S,  H,  Nh), 

restrict_list(S,  T,  Nt). 


%  Restrict  axioms 
restrict(uminus,  -(U+  V),(-U)+  (-V)). 
restrict(_,  U,  U). 

%  Test  the  normalize  Procedure.  Despain,  November  1984 

t  >  reconsult(normalize),  reconsult(display),  tl,  td,  tu. 

tl  >  Exp  =  (0<(-(a+  b))), 

normalize(Exp,  Ans),  display(Exp,  Ans). 

td  >  Exp  =  a+  b/c, 

normalize(Exp,  Ans),  display(Exp,  Ans). 

tu  Exp  =  -(a+  b), 

normalize(Exp,  Ans),  display(Exp,  Ans). 
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%  Tidy  procedure.  Despain,  July  1984. 

tidy(U,Z)  tidyax(U.V),  V  =..  [Op|  Args] ,tidy  Jist(  Args.Nargs), 
W  =..  (Op|Nargs],tidyax(W,Z),!. 
tidy(U.V)  tidyax(U,V). 

tidy_list(  1,0). 

tidy_list{  H|T),[NhlNtJ) tidy(H,Nh),tidy  Jist(T,Nt). 


%  Tidy  axioms 
tidyax(U  =  U,true). 
tidyax(-(-U),U). 

tidyax(U"l,U). 

tidyax(l*U,l). 

tidyax(U'0,l). 

tidyax(0‘U,0)  number(U),  U  ==  0. 

tidyax(U*l,U). 

tidyax(l*U,U). 

tidyax(U*0,0). 

tidyax(0*U,0). 

tidyax(U+  0,U). 
tidyax(0+  U,U). 

tidyax(U-O.U). 

tidyax(O-U.-U). 

tidyax(0+  (-U),-U). 

tidyax(U-(-V),U+  (V)). 

tidyax(Iog(U,U),l). 

tidyax(log(U,l),0). 

tidyax(U*(V*W),U*V*W)  atomic(V). 
tidyax((U*V)*W,U*V*W)  >  atomic(W). 

%  This  axiom  does  simple  arithmetic  when  it  can  to  tidy  up  expressions. 
tidyax(Exp,Rs!t)  Exp  =s..|Op,Al,A2], 
number(Al), 
number(A2), 

Rslt  is  Exp. 

%  These  may  not  be  a  tidy  axioms  that  are  needed  to  solve  eqs, 

%  but  collect  axioms. 

tidy  ax  (X  *  X,  X‘2). 

tidyax(X  *  X*N,  X*M)  M  is  N+  1. 

tidyax(X‘N  *  X,  X‘M)  M  is  N+  1. 

tidyax(X‘N  *  (X),  X*M)  >  M  is  N+  1. 

tidyax(X‘L  *  X  N,  X‘M) M  is  N+L. 


%  These  may  not  be  tidy  axioms,  but  isolate  axioms. 
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%  Test  case. 

%  Tidy  axioms 
tidyax(U  =  U.true). 
tidyax(-(-U),U). 
tidyax(U‘  1,U). 
tidyaxjl  U,l). 
tidy  ax(U‘ 0,1). 

tidyax(O'U.O)  integer(U),  U  ==  0. 

tidyax(U*l,U). 

tidyax(l*U,U). 

tidyax(U*0,0). 

tidyax(0*U,0). 

tidyax(U+  0,U). 

tidyax(0+  U,0). 

tidyax(U-O.U). 

tidyax(0-U,(-U)). 

tidyax(U+  (-0),U). 

tidyax(0+  (-U).(-U)). 

%  It  is  not  clear  if  the  following  axiom  is  a  true  tidy  axiom  or  not. 
tidyax(U-(-V),U+  V). 


tidyax(U.U). 
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%  Isolate  procedure.  Despain  Nov  84 

%  isolate(Position,  Exp,  Ans). 
consult(isolax), 
reconsult(tidy), 
reconsult(display). 

%  Test. 

ti  reconsult(isolate_test),!,nl,  t(3),fail  ;  true. 

isolate(( l|T],  L  =  R,  A)  >  iso(T,  L  =  R,  A). 
isolate((2|T],  L  =  R,  A)  iso(T,  R  =  L,  A). 

%iso(U,V,X)  >  write(U), write]’  :  ’),write(V), write]’  :  ’),write(X),nl,fail. 
%  this  is  for  testing... 

iso( ),  X,  X). 

iso(  HJT],  FX  =  W,  Ans)  !, 

isolax(H,FX  =  W,  RHS,  Condition), 

Condition, 
tidy(RHS,  New),!, 
iso(T,  New,  Ans). 

%  Misc  functions 
arbint(nO). 

even(N)  integer(N),  0  is  N  mod  2. 
odd(N)  integer(N),  1  is  N  mod  2. 
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%  Isolate  procedure  testing.  Despain 

t(l)  Exp  =  (-sin(x)  =  y), 
isolate([l],  Exp,  Ansj, 
display(Exp,  Ans). 

t(2)  Exp  =  (sin(a)+  x=z), 
isolate([2],  Exp,  Ans), 
display(Exp,  Ans). 

t(3)  Exp  =  (log(e,xA2-l)=3), 
isolate([  1, 2,l,l|,  Exp,  Ans), 
display(Exp,  Ans). 

t(4)  Exp  =  (x*3+  y=z), 
isolate([l,l],  Exp,  Ans), 
disp!ay(Exp,  Ans). 


t(5)  Exp  =  (xA3=z), 

isolate([l],  Exp,  Ans), 
display(Exp,  Ans). 

t(6)  Exp  =  (x"4+y=z), 
isolate([l,l],  Exp,  Ans), 
display(Exp,  Ans). 

t(7)  Exp  ='  ((a+  x‘6)*y=z), 
isolate([l,2,l],  Exp,  Ans), 
display(Exp,  Ans). 


t(8)  Exp  =  (xAy=z), 

isolate([l|,  Exp,  Ans), 
display(Exp,  Ans). 


Nov  84 
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%Solve  Procedure  Despain  Nov  84 

reconsult(tidy),  reconsult(isolate),  reconsult(singIeocc), 
reconsult(no_occur),  reconsult(position), 
reconsult(collect),  reconsult(attract),  reconsult(closeness). 

%  Solve  procedure  testing.  Despain  Nov  84 

ts  reconsult(display),  reconsult(solve_tests), 
nl,  t(N,  Exp), 

write(’CASE:  ’),  write(N),nl,  solve(Exp,  x,  Ans), 
display(Exp,  Ans),  fail. 

%  Main  solve  procedure. 

solve(L  =  R,  X,  Ans) 

singleocc(X,  L  =  R),!, 
position(X,  L  =  R,  P),!, 
isolate(P,  L  =  R,  Ans). 

solve(Eqn,  X,  Ans)  > 

collect(X,  Eqn,  New),!, 
solve(New,  X,  Ans). 


solve(Eqn,  X,  Ans) 

attract(X,  Eqn,  New), 
closeness(X,  Eqn,  EC), 
closeness(X,  New,  NC), 
EC  >  NC,  !, 
solve(New,  X,  Ans). 

%  Solve  procedure  testing. 
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consult(tidy),  consult(isolate),  consult(solve), 
consult(singleocc),  consult(no_occur),  consuit(position), 
consult  (display). 

t  nl,  t(N,  Exp),  write(’CASE:  ’),  write(N),nl,  so!ve(Exp,  x,  Ans), 
display(Exp,  Ans),  fail. 

t(l,(-sin(x)  =  y)). 
t(2,(sin(a)-f  x=z)). 
t(3,(sin(a)+  x=z)). 
t(4,(x*3+y=z)). 
t(5,(x‘3=z)). 
t(6,(x'4+y=z)). 
t(7,((a+x‘6)*y=z)). 
t(8,(x‘y=z)). 
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%  Singleocc  procedure 

singleocc(X,(H|T])  singleocc(X,  H),!, 
no_occurence(X,  T). 

singIeocc(X,(H|T|)  no_occurence(X,  H),f, 
singleocc(X,  T). 

singleocc(X,X). 

singleocc(X,Exp)  Exp  [Op,  Y|T], 
singleocc(X,  Y), 
no_occurence(X,  T). 
singleocc(X,Exp)  Exp  =..  [Op,  Y|T], 
singleocc(X,  T), 
no_occurence(X,  Y). 

%  Test  of  singleocc  procedure 

t  reconsult(singleocc),  reconsult(no_occur),tt. 

tt  >  singleocc(x,  [g(y,a+  b,[e,x,d]),c]). 
tf singleocc(x,  g(y,a+  b,x),x). 
tn  singleocc(x,  g(y,a+  b,z),c). 

singleocc(X,[H|T[)  singleocc(X,  H),!, 
no_occurence(X,  T). 

singleocc(X,[H|T[)  no_occurence(X,  H),!, 
singleocc(X,  T). 

singleocc(X,X). 

singleocc(X.Exp)  Exp  =..  [Op,  Y|T], 
singleocc(X,  Y), 
no_occurence(T,  Y). 
singleocc(X.Exp)  Exp  — [Op,  Y|T], 
singleocc(X,  T), 
no_occurence(X,  Y). 

%  no_occurence  procedure 
no_occurence(X,[|). 

no_occurence(X,H)  atomic(H),  X  ==  H. 
no_occurence(X,[H|T[)  no_occurence(X,  H),!, 
no_occurence(X,  T). 

no_occurence(X,Exp)  Exp  =..  [Op,  YjTj, 
no_occurence(X,  Y),!, 
no_occurence(X,  T). 
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%  Position  procedure.  Despain,  July  1984. 

%  Singleocc  procedure  must  proceed  this,  as  it  will  wrongly  report  if 
%  there  is  no  occurance  of  X. 


%  Position(Symbol,  Expression,  Index). 


position(X, 

U 

!)• 

position(X, 

X] 

-|1] 

)• 

position(X, 

X| 

T 

|  > 

1 

position(X, 

H| 

T 

|  > 

1 

position(X, 

H| 

T 

1 1 

P 

)  >  no_occurence(X,  T). 

Qj)  >  no_occurence(X,  T),  position(X,H,Q). 
|Qj)  no_occurence(X,  H),  position(XfT,[S|Q|), 
P  is  S  +  1. 


position(X,X,[lj). 

%  position(X,X,[|).  THIS  ORIGINAL  DEFINITION  SEEMS  WRONG 

position(X,Exp,Pos)  Exp  =..  [Op|List], 

Op  ==  .,!, 

position(X,List,Pos). 

%  Test  of  Position  procedure.  Despain,  July  1984. 


nox(Exp)  no_occurence(x,Exp). 
po(Exp)  position(x,Exp,N),print(N). 

t  reconsult(no_occur),  reconsult(position),  reconsult(display),!, 
nl,  t(N,  Exp),  writej’CASE:  '),  write(N),nl,position(Exp,Ans), 
display(Exp,  Ans),  fail. 

position(Exp,  Ans)  position(x,  Exp,  Ans),!. 

t(false,[x,[a,[x]|,c]). 

t(true,[y,[a,[x]],c]). 

t(l,(-sin(x)  —  y)). 

t(2,(sin(a)+  x=z)). 

t(3,(sin(a)+  x=z)). 

t(4,(x*3+y=z)). 

t(5,(x‘3=z)). 

t(6,(x‘4+y=z)). 

t(7,((a+x'6)*y=z)). 

t(8-(x‘y=i)). 

t(none,  (y ,[a,(w]],c])  write(’Note  the  false  answer  here’),nl. 
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display(Exp,  Ans) 

print(’  Input  Expression  is  :  '),  print(Exp),ni, 
print('  The  Solution  is  :  '),  print(Ans), nl.nl. 


%  Test  Equations  for  testing  PRESS  procedures.  Despain  August  84 

%  Test  expressions:  t(Name,  Expression). 

t(0,(x+  x=a-b)). 

t(l,(-sin(x)  =  y)). 

t(2,(cos(a)+  x=z)). 

t(3,(log(e,x‘2-l)=3)). 

t(4,(x*3+y=z)). 

t(5,(x*3=z)). 

t(6,(x*4+y=z)). 

t(7,((a+x‘6)*y=z)). 

t(8,(x‘y=z)). 

%  Tests  of  Bundy  and  Welham. 
t(9,  (log(e,(x+  l)*(x-l))=3)). 
t(10,  (log(e,x+  1)+  log(e,x-l)=3)). 
t(ll,  (2-(x-2)  (x'3)=2)). 

t(12,  ((2  (cos(x)'2)*2‘(sin(x)*2))*cos(x)  =  2"(l/4))). 

%  Collect  procedure  Nov  84 

%  Warning,  dont  try  to  backtrack  through  this  procedure. 

reconsuit(tidy), 

reconsult(con  tains), 

reconsult(match), 

reconsult(collax), 

reconsult(least_dom), 

reconsult(no_occur), 

reconsult(display). 

tc  reconsult(coHect_tests),  nl, 
t(N,  Exp), 
write(’CASE:  ’), 
write(N),nl, 
write( 'Input:  ’), 
write(Exp),nl, 
collectx(x,  Exp,  Ans), 
display(Exp,  Ans),  fail. 

tell  :•  t(ll,E),  collect(x,E,A),  nl,display(E,A),nl. 
tc3  :-  t(3  ,E),  collect(x,E,A),  nl,display(E,A),nl. 

collectx(X,  Old,  New)  :-  collect(X,  Old,  New),  !. 

collect(X,  Old,  New)  :- 

contains(X,  Old), 
least_dom(X,  Old), 
collax(U,  LHS,  RHS), 
match(OId,  LHS), 
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contains(X,  U), 
tidy(RHS,  New). 

collect(X,  Old,  New) 

contains(X,  Old), 

Old  =..|Op,AH|AT], 

(  (  (  collect(X,  AH,  NH), 

New  =..|Op,  NHJAT]  ); 

(  collect(X,  AT,  NT), 

New  =..[Op,  AH|NT|  )  ); 

(  AH  =..[0p,  Hj(T3j],  %  This  solves  the  problem  created  by  the 
T  =..  [Op,T3|AT],  %  fact  that  expressions  are  scanned 

(  collect(X,  T,  NT),  %  backward  when  Op’s  are  the  same. 

New  —..[Op,  H,NT]  );  %  Ex:  x*2*x*x  is  collected.. 

(  collect(X,  H,  NH), 

New  =..(Op,  NH|T|  )  )  ). 

%  (  AT  =..[Op,  Hl|Tl),  %  It  is  not  clear  that  this  case  ever 

%  H  =..  [Op,  AH|H1],  %  occurs,  so  it  may  be  useless  work. 

%  (  collect(X,  H,  NH), 

%  New  =..[Op,  NH|Tl]  ); 

%  (  collect(X,  Tl,  NT), 

%  New  =..|Op,  H|NT]  )  )  ). 
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%  Test  of  collect  procedure.  Despain  August  84 

tc  >  t(10,E),  collect(x,E,A),  nl,write(A),nl. 

t  :•  reconsul t(collect),  reconsult(tidy), 

reconsult(contains),  reconsult(match),reconsult(collax), 
reconsult(least_dom), 

%  reconsult(test_eqns), 
reconsult(display),!, 

nl,  t(N,  Exp),  write(’CASE:  '),  write(N),nl,  collectx(x,  Exp,  Ans), 
display(Exp,  Ans),  fail. 

collectx(X,  Old,  New)  collect(X,  Old,  New), !. 

%  Test  Equations  for  testing  Collect  procedure.  Despain  August  84 

%  Test  expressions:  t(Name,  Expression). 
t(0,(x+  x=a-b)). 
t(l,(-sin(x)  =  x)). 

%  Tests  of  Bundy  and  Welham. 
t(10,  (log(e,x+  1)+ log(e,x-l)=3)). 
t(9,  (log(e,(x+  l)*(x-l))=3)). 
t(ll,  (2'(x‘2)‘(x‘3)=2)). 

t(12,  ((2‘(cos(x)‘2)*2"(*-in(x)'2))'cos(x)  =»  2*(l/4))). 

%  Collect  axioms. 

%  special  collect  axiom... 
collax(X,  X‘X‘0  =  1,  X  =  1). 
collax(X,  X‘X*N  =  1,  (X  =  0  ;  X  =  1)). 

collax(X,  X  -  X,  0). 
collax(X,  X  +  X,  2*X). 

collax(X,  X  *  X,  X*2). 
co!lax(X,  X+  W*X,  X*(l+  W)). 
collax(X,  X*U+  X*V,  X*(U+  V)). 

collax(X,  X*X‘V,  XA(1+V)). 
collax(X,  X*U*X*V,  X*(U+  V)). 
collax(X,  U‘X*VX,  (U*V)*X). 
collax(X,  (X+U)*(X-U),  X‘2-U*2). 

bO. 

collax(X,  log(X,U)+  log(X,V),  log(X,U*V)). 

collax(X,  Iog(X,U)+  log(X,V),  log(X,U*V))  :-b0. 

collax(X,  log(X,U)-log(X,V),  log(X,U*V*(-l))). 

collax(X,  log(U,X)+  log(V.X),  log(E,X)*(log(U,E)+  log(V,E))). 

collax(X,  sin(X)‘2+  cos(X)‘2,l). 
collax(X,  sin (X)*cos(X) * (•  1  ),tan(X)). 
collax(X,  sin(X)*cos(X),sin(2*X)*2  ‘ (-1)). 
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%  Least_dominate  procedure. 

least_dom(X,  X‘Y  =  1)  !.  ^Special  case  for  x*x‘N=l  etc.... 

least_dom(X,  Exp) 

Exp  =..|Op,H|T], 
contains(X,  H), 
contains(X,  T). 

%  Contains  procedure.  contains(X,  Exp). 

contains(X,  X). 
contains(X,  Exp) 

Exp  ==  ID), 

Exp  =..|Op,H|Tj, 

(contains(X,  H); 
contains(X,  T)). 

%  Match  procedure. 

match(A,  A). 

match(A  =  B,  X  =  Y) crossmatch(A,  B,  X,  Y). 
match(A  *  B,  X  *  Y)  crossmatch(A,  B,  X,  Y). 
match(A  +  B,  X  +  Y)  >  crossmatch(A,  B,  X,  Y). 
match(A  -  B,  (-Y)  +  X  )  :-  crossmatch(A,  B,  X,  Y),!. 
match((-A)  +  B,  Y  -  (X))  crossmatch(A,  B,  X,  Y),!. 
match(U,  V)  > 

u  ==  ID], 

U  =..[Op,H|T], 

V  —  Kll, 

V  =..[Op,F|L|, 
match(H,  F), 
match(T,  L). 

match(U'l,  U). 
match(U,  U*l). 

crossmatch(A,  B,  X,  Y)  > 

((match(A,  X), 
match(B,  Y)); 

(match(A,  Y), 
match(B,  X))). 
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%  Attract  procedure 

reconsult(attrax), 

reconsu!t(least_dom), 

reconsult(match), 

reconsult  (con  tains), 

reconsult(closeness), 

reconsult(tidy), 

reconsult(display). 

%  Test  of  attract  procedure 

ta  reconsult(attract_tests), 
nl,  t(N,  Exp), 
write(’CASE:  ’), 
write(N),nl, 
attractx(x,  Exp,  Ans), 
display(Exp,  Ans), fail. 

ta22  >  t(22,  Exp),attract(x,  Exp,  Ans), 

nl,write(’ta22  Input:  '),  write(Exp),nl, 
nl,write(’ta22  Output:  ’),  write(Ans),nl. 

t(22,  (2"((xA2)"(x*3))=2*x"0)). 

ta21  :-  t(21,  Exp),attract(x,  Exp,  X), 

nl,write(’ta21  Input:  ’),  write(X),nl, 
nl,write(’ta21  Output:  ’),  write(X),nl. 

t(21i  (2‘((x"2)A(x*3))=2)). 

ta23  >  t(23,  Exp),attract(x,  Exp,  X), 

nl,write(’ta21  Input:  ’),  write(X),nl, 
nl,write(’ta21  Output:  ’),  write(X),nl. 

t(23,  (x*2*x*x=2)). 


attractx(X,  Old,  New) :-  attract(X,  Old,  New),  write('X  :  ’),  write(New),nl, 

attract(X,  Old,  New)  :- 

least_dom(X,  Old), 
attrax(Ulist,  LHS,  RHS), 
matchjoid,  LHS), 
checklist_contains(X,  Ulist), 

%  closeness(X,  LHS,  Sold), 

%  closeness(X,  RHS,  Snew), 

%  Snew  <  Sold  , 

tidy(RHS,New). 

attract(X,  Old,  New)  :- 
Old  ==  (], 

Old  ~  mi, 

Old  »..|Op,H|T], 
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((attract(X,  H,  NH), 
New  =..[Op,NH|T]); 
(attract(X,  T,  NT), 
New  =..[Op,H|NT])). 


attract(X,  L  =  R  ,  New) 
least_dom(X,  Old), 
attras(Ulist,  L  =  R,  RHS), 
match(01d,  LHS), 
checklist  contains(X,  Ulist),!, 
tidy(RHS^New). 


attras([U,V],U‘V  =  1,  U  =  1). 


%checklist_contains(X,  H|T  )  contains(X,H). 
%checklist_contains(X,  H|T  )  :•  check list_contains(X,  T). 


checklist_contains(X,  H))  contains(X,H). 

checklist_contains(X,  H|T])  contains(X,H),  checklist_contains(X,  T). 
%  Closeness  procedure, 
tel  >  reconsult(least_dom), 
reconsu  1 1  (con  t  ai  ns), 
closeness(x,a+  x=b*(c+  x),M), 
nl,  write(M),nl. 


closeness(X,  E,  V) close(X,  E,  V),!. 
close(X,  X,  0). 


c!ose(X,  Exp,  M) 

least_dom(X,  Exp), 
tree_size(X,  Exp,  _,  0,  M),!. 


close(X,  Exp,  M)  > 

Exp  ==  [ , 

Exp  ==  j  ]], 

Exp  =..(Op,H|Tl, 

((contains(X,  H),  close(X,  H,  M)); 
(contains(X,  T),  closejx,  T,  M)j). 


tree_size(X,  [I  ,  0,  Sizejn,  Sizejn)  !. 
tree_size(X,  |(||,  0,  Sizejn,  Sizejn)  !. 

%tree_size(X,  X,  1,  Sizejn,  Sizejn)  !. 
treejsize(X,  X,  1,  Sizejn,  Size_out)  Size_out  is  Sizejn  +  1,!. 
tree_size(X,  Y,  0,  Sizejn,  Sizejn  )  atomic(Y),!. 
tree_size(X,  Exp,  HX,  Sizejn,  Size_out)  > 

Exp  ==  I , 

Exp  »  ( ]], 

Exp  f=..[Op,H|T|, 
tree_size(X,  H,  FX,  Sizejn,  S), 
tree_size(X,  T,  GX,  S,  S2), 
or(FX,  GX,  HX), 

((Op  =  ’•’)-> 

(Size_out  is  S2)  ; 

(Size_out  is  S2  +  HX)). 


v  ~r  V  V-.’ ■  ’  f-.  '- 


immmm 


*•/ 
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%  Fundamental  axioms.  Suggested  by  Rothaus.  Despain,  July  84. 
%  Addition  axioms 

axiom(A+  B,B+  A).  %  Commutation 

axiom(A+  (B+  C),(A+  B)+  C).  %  Distribution 

axiom(A+  0,A).  %  Existence  of  Zero 

axiom(A+  (-A),0).  %  Existence  of  Negative 


%  Multiplication  axioms 
axiom(A*B,B*A).  %  Commutation 

axiom(A*(B*C),(A*B)*C).  %  Distribution 

axiom(A*l,A).  %  Existence  of  One 

axiom(A*A*(-l),l)  A  ==0.  %  Existence  of  Inverse 

axiom(A‘(-l),l/A).  %  Definition 


%  Distribution 

axiom(A*B+  A*C,A*(B+  C)).  %  Add/Mult  Distribution 


%  Exponention 
axiom(A"X*BAX,(A*B)‘X). 
axiom(A'X*A*Y,A'(X+  Y)). 


%  Distribution 
%  Distribution 


axiom(A‘lpA). 
axiom(0'0,l). 
axiom(log(e*X),X). 
axiom(root(X,A'X),A). 
axiom(root(2,X),sqrt(X)). 


%  Existence 
%  Definition 
%  Existence  of  Inverse 
%  Existence  of  inverse 
%  Definition 


%  Trig 

axiom(sin(0),0). 

axiom(sin(pi),0). 

axiom(cos(0),  1). 

axiom(cos(pi),-l). 

axiom(sin(A)'2+  cos(A)*2,l). 

axiom(sin(A)*cos(A)‘(-l),tan(A)). 

axiom(sec(A)~(-l),sin(A)). 

axiom(cosec(A)*(-l),co8(A)). 

axiom(cotan(A)‘(-l),tan(A)). 


%  Definition 
%  Definition 
%  Definition 
%  Definition 

%  Definition 
%  Definition 
%  Definition 
%  Definition 
%  Definition 


axiom(arcsin(sin(A)),A). 

axiom(arccos(cosi(A)),A). 

axiom(arccosec(cosec(A)),A). 

axiom(arctan(tan(A)),A). 

axiom(arcsec(sec(A)),A). 

axiom(arccot(cot(A)),A). 


%  Existence  of  Inverse 
%  Existence  of  Inverse 
%  Existence  of  Inverse 
%  Existence  of  Inverse 
%  Existence  of  Inverse 
%  Existence  of  Inverse 
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%  This  procedure  partially  compiles  axioms  at  the  1th  level 

la  listing(ax). 

p_ax(N)  M  is  N+  1, 
ax(N,Exp,Res), 
asserta((ax(M,Exp,Res))),  fail. 
p_ax(N) demolish((ax(N,Exp,Res) Body)). 
p_ax(N)  listing(ax). 

demolish(Clause)  retract(Clause),demolish(Clause). 


ax(0,Exp,Res)  axiom(Exp,Res),  improved(Exp,Res). 
ax(0,Exp,Res)  :•  axiom(Res,Exp),  improved(Exp,Res). 

ax(N,Exp,Res) M  is  N-1,M>0, 

Iemma(Exp,Nex)l 
ax(M,Nex,Res) , 
improved(Exp,Res), 

Exp  ==  Res,  %  Put  in  to  fix  a  bug.  Occur  check? 

not(axm(M,Exp,Res)). 

axm(M,Exp,Res)  ax(M,Exp,Res). 
axm(M,Exp,Res)  >  N  is  M-l,  N>0,  axm(N,Exp,Res). 

%-Co6t  of  expression  evaluator. 
cost(X,0)  atomic(X),!. 
cost(X,0)  var(X),!. 

cost(Exp,  Cost)  >  Exp  =..  [Op|Args],  cst(Op,Cl), 

list_cost(Args,C2),  Cost  is  Cl  +  C2,!. 


list_cost( ),  0). 

list_cost(  H|T|,  Cost)  cost(H,Cl),  list_cost(T,  C2),  Cost  is  Cl  +  C2. 


cst(=,l)  !. 
cst(+  ,1)  !. 

cst(-,l)  !. 
cst(*,l)  !. 
cst(M)  >  !. 
cst(/,l)  !. 
cst(_,0). 

%lemma(Exp,Res)  >  axiom(Res,  Exp). 
lemma(Exp,Res) Exp  =====  Res, 
axiom(Nex,  Exp), 

Nex  =..  [Op|Args], 
list_axioms(Args,Nargs), 
Res  *=..  (Op|Nargs). 

list_axioms([],(|). 

list_axioms([H|T|,(NH|NT|) H  —  NH, 
axiom(H,NH), 


%  Put  in  to  fix  a  bug.  Occur  check? 


%  Put  in  to  fix  a  bug.  Occur  check? 
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list_axioms(T,NT). 

improved(Exp,Res) cost(Exp,Cl), 
cost(Res,C2), 

C1>C2,!. 


axiom(A,A). 

%  Addition  axioms 

axiom(A+  B,B+  A).  %  Commutation 

axiom(A+  (B+  C)f(A+  B)+  C).  %  Distribution 

axiom(A+  0,A)-  %  Existence  of  Zero 

axiom(A+  (-A),0).  %  Existence  of  Negative 


%  Multiplication  axioms 
axiom(A*B,B*A). 
axiom(A*(B*C),(A*B)*C). 
axiom(A*l,A). 

axiom(A*AA(-l),l)  >  A  ==0. 


axiom(AA(-l),l/A). 

%  Distribution 
axiom(A*B+  A*C,A*(B+  C)). 

%  Exponention 
axiom(A"X*B"X,(A*B)'X). 
axiom(A'X*A*Y,A‘(X+  Y)). 
axiom(A‘l,A). 
axiom(00,l). 


%  Commutation 
%  Distribution 
%  Existence  of  One 

%  Existence  of  Inverse 
%  Definition 


%  Add/Mult  Distribution 


%  Distribution 
%  Distribution 
%  Existence 
Definition 


% 
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